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SEQUENCE LISTING 




(1) INFORMATION FOR SEQ ID NO: 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 305base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA ~q 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 1 ^ 
Sea ID No. 1# Sequence of 31.2 clone: 

5 .: ATCGCCGTAA TTGCCATGTT TTCCCTCTCA CCGGAATCCT ACGTTATCC 
CCTTACCTTC GTGAACATTA CAGT AG GAAT CGGTGGTCCA ATTATCAACT 
TAATTTTGGG CGCATCTGTT CGTGTTAACT AG AAG C CAT G TATACATACA 
AT ACAACAT G GTTCACTCCT CCTACAGATT ATGAGTTGAA CTTTTATAAT 
AAGTT GTAAT AATGGCTTCT GAATAAGGAG AAG AG GAG C C TCTGTTTGTT 
T TACT TAT T A CAGAT GT GAT ATCGTTCAAC AACTTTGATT CTGCGAAAAA 
AAAAA-3 ' 

(2) INFORMATION FOR SEQ ID NO : 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA 

(iii) SEQUENCE DESCRIPTION: SEQ 
Seq ID No.2# Sequence of 21.2 Clone: 
5 ' - AGAAGTACCT GAAAG GAAGC T T AAC GAG GT G AAC AT C CAT 
TGCAGCCAGC CCTGGAATCT 'GTACAGGGCA ACTCTGAACC GGAATTATTT 
TAATAACCCG TGGGCAATGA TTGCAATTAT GGCTCGTTTG GTATTACTTC 
TACTCACTTA G AC AC AAC T G TATTTACGGT TTTCGCTGGA ATTGTAATTG 
TTGGAGCGAC AAAATAGATG GTCACAACTT AT T GGT GAGA GTATCAGTGT 
GCTCTTCTTT ATCGTCTTTA ACTCTCCGTG GTAAT TACT T T GACAATATT 
CATACAT -3' 
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SEQUENCE U5TINC 
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ID NO: 2 



O) GENERAL INFORMATION: . 
© AFFOCANTi * 
<10 TnXE Of INVENTION: 
<U0 NUMBER Of SEQUENCES: 
0 v) CORKESTONOENCE ADORESS: 

(A) ADDRESSEE: 

(B) STREET: 
CO QTY: 
(O) STATE: 

<E) COUNTRY; 

(v) COMPUTER READABLE FORM: 
, , (A) MEDCUMTYFE: 
<P) COMPUTER: 
(Q OPERATING SYSTEM: 

<E>> SOFTWARE: 
(vO CURRENT APPLICATION DATA : 

(A) AFrtJCATION NUMBER; 

(B) HUNG DATE: 

fO CLASS! FICATION: 
MO PRIOR APTLICATION DATA 
(A) APTUCATION NUMBER: 

<B) FILING DATE: 
(vtlO ATTORNEY/ AGENT INFORMATION : 

(A) NAME: 

(U) REGISTRATION NUMBER: 
(O REFERENCE/ DOCKET NUMBER: 
Ox) TELECOMMUNICATION INFORMATION: 
CA) TELEF1 IONE: 

(B) TELEFAX 
(O TELEX* 

0 INFORMATION FOR SEQ ID NO. X 

O SEQUENCE O I ARACTERJSTICS: 

(A) LENGTH: 

(B) TYPE: 

(O STRANDEDNESS: 
(O) TOPOLOGY: 

UO SEQUENCE DCSCRimON: SEQ JD NOX 
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<110> 
<120> 
<130> 
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SPECIMEN SEQUENCE LISTING 
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Of 



f/f 



Smith, John; Smithgene Inc. 



Example of a Sequence Listing 



01-00001 



<140> 
<141> 



PCT/EP98/00001 
1998-12-31 



<150> 
<151> 



US 08/999, 999 
1997-10-15 



<160> 
<170> 



Pa tent In vers ion 2 . 0 



<210> 
<211> 
<212> 
<213> 



1 

389 
DNA 

Paramecium sp 



<220> 
<221> 
<222> 

<300> 
<301> 

<302> 

<303> 
<304> 
<30S> 
<306> 
<307> 
<308> 
<309> 



<400> 

age tg tag t c 



CDS 

(279) ... (389) 



Doe , Richard 

Isolation and Characterization of a Gene Encoding a 

Protease from Paramecium sp . 

Journal of Genes 

1 

4 

1-7 

1988-06-31 

123456 

1988-06-31 



attcctgtgt cctcttctct ctgggcttct caccctgcta atcagatctc 



agggagagtg tcttgaccct cctctgcctt tgeagcttea caggcaggca ggcaggcagc 
tgatgtggca attgetggea gtgecacagg cttttcagcc aggcttaggg tgggttccgc 



cgcggcgcgg 



cggcccctct cgcgctcctc tcgcgcctct ctctcgctct cctctcgctc 



60 
120 
180 
240 



# 
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ggacctgatt aggtgagcag gaggaggggg cagttagc 



ttg 
Leu 



tgt 
Cys 



tct 
Ser 



ccc 
Pro 



ttc 
Phe 



aaa 
Lys 

25 



aaa 
Lys 
10 



g tc 
Val 



tgg 
Trp 



etc 
Leu 



cct 
Pro 



ccc 
Pro 



Gly 



tgt 
Cys 



ttt 
Phe 



cac 
His 

30 



tgt 
Cys 
15 



tea 
Ser 



ttg 
Leu 



tea 
Ser 



atg 
Met 
1 


gtt 
Val 


tea 
Ser 


atg 
Met 


ttc 
Phe 
5 


age 
Ser 


296 


ttt 
Phe 


gtt 
Val 


tgt 
Cys 


ttg 
Leu 
20 


ttc 
Phe 


caa 
Gin 


344 


ctg 
Leu 


cag 
Gin 


ccg 
Pro 

35 


aat 
Asn 


ctt 
Leu 


» 


389 
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<212> 
<213> 



2 

37 
PRT 

Paramecium sp 



<400> 
Met Val 
1 



Phe Val 



Ser Met 



Phe Ser Leu Ser Phe Lys Trp Pro Gly Phe 
5 10 



Cys Leu Phe Gin Cys Pro Lys Val Leu Pro Cys His 



Cys Leu 
15 



Ser Ser 



Leu Gin 



Pro Asn Leu 
35 



<210> 
<211> 
<212> 
<213> 



3 

11 
PRT 

Artificial Sequence 



<220> 
<223> 



<400> 
Met Val 
1 



Designed peptide based on size and polarity to act as^a 
linker between the alpha and beta chains of Protein XY<i. 

3 

Asn Leu Glu Pro Met His Thr Glu lie 

5 10 



<210> 
<400> 
000 



4 
4 
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<130> :. 



<T40> 



<141> .... 
<150> 



ttuvg^npe fin Ui on 



'Title p f li^ntion * 

File Reference 




<^mments;^nd^^^ 



Preferably max. of 10 na m es; <^eifi^cr\e ^o^ w 
preferable format: Surname,- Other Nam 
or Initials. 



• *•'** fc*V* *>•• *j 



*151> 



<160> 



<210> 
<211> 



/tumeric Iden- 
lifter 



<212> 



<213> 



<220> 



<221> 



<222> 



<223> 



<300> 
<301> 

<302> 
<303> 
<304> 
<305> 
<306> 
<307> 

<30G>. 

<309> 

<310> 

<31 1> 

<312> 

<313> 
<400> 



Current Application 

Number, 

Current Filing Date 

Prior Application Nunv. " 
: ;ber. . " 
Prior Application Filing 

Date. 

Number of SEQ ID NOs 
Software .™».^.........„, 

SEQ ID NO:#: 

♦ 

Lenglh . 



Definilion 



Type 



Organism 



Feature 



Name/Key 



Location 



Other Information 



Personal file reference • 

Specify as: US 07/999,999 or PCT/US96/99999-* 



Specify as: yyyy-mm-dd 

Specify as: US 07/999.999 or PCT/US96/99999 



Specify as: yyyy-mm-dd 



Count iAcludes total number of SEQ ID-NOs 

Name of software used id ! Vr^lo^ln¥^cqu6nciB 
Listing. 

Response shall be an integer representing the 
- SECMO NO shown. 

Respond with an integer expressing the number 
of bases or amino acid residues. 



Publication Information 
Authors 

Title 

Journal 

Volume 

Issue 

Pages , 

Dale 

Database Accession 

Number. 
Database Entry Date .., 

Patent Document Num- 
ber. 

Patent Filing Date 

Publication Date 

Relevant Residues .../:. 
Sequence 



Comments and. format 



Whether presented sequence molecule Is DNA, 
RNA. or PRT (protein). If. a nucleotide se- 
quence contains both DNA and RNA frag- 
ments, the type shall be "DNA." In addition, 
the combined DNA/RNA molecule shall be fur- 
ther described In the <220> to <223> feature 
section. 

Scientific name. I.e. Genus/ species, Unknown or 
Artificial Segucnce. In addition, the "Unknown" 
cf "Artificial Sequence" organisms shall be fur- 
' ther described In the <220> to <223> feature 

Leave blank after <220>. <221-223> provide for 
a description of points of biological significance 
in the sequence.. 

Provide appropriate identifier for foalure, pref- 
erably from WIPO Standard ST.25 (1998), Ap- 
pendix 2, Tables 5 and 6. 

Specify location within sequence; where appro- 
priate state number of first and last bases/ 
amino acids in feature. 

Other relevant information; four lines maximum ... 



Leave blank after <300> : 

Preferably max of ten named authors of publica- 
tion; specify one name per line; preferable for- 
mat': Surname. Other Names and/or Initials. 



Journal date on which data published; specify as 
yyyy-mm-dd, MMM-yyyy or Season-yyyy. 

Accession number assigned by database indud* 
ing database name. 

Date of entry in database; specify as yyyy-mm- 
dd or MMM-yyyy. 

Document number; for patent-type citations only. 
Specify as. for example, US 07/999,999. 

Document filing date, tor patent-type citations 
only; specify as yyyy-mm-dd. 

Document publication date, tor patent-type sta- 
tions only; specify as yyyy-<nm-dd. 

FROM (position) TO (position) • 

SEQ ID NO should follow the numeric identifier 
and should appear on the line preceding the 
actual sequence. 




M when filed prior to 
ber. 

iLavajfajye., 



assignment of appt. num- 



M, if available. 

M, if applicable indude priority documents under 

35 USC"1'19 and T20; 
M, if applicable. 

M, 
O. 



M, 



M. 



Mandatory |M) or optional (O). 



M. 



M 



M, under the following conditions: if "n," "Xaa," 
or a modified or unusual L-amino add or modi- 
fied base was used in a sequence; if ORGA- 
NISM is "Artificial Sequence" or "Unknown"; K 
molecule is combined D^/VRNA" 

M, under the following conditions: if "n," "Xaa," 
or a modified or unusual L-amino add or modi- 
fied base was used in a sequence. 

M, under the following conditions: if "n," "Xaa," 
or a modified or unusual L-amino add or modi- 
fied base was used in a sequence. 

M, under the following conditions: if "n," "Xaa," 
or a modified or unusual L-amino add or modi- 
fied base was used in a sequence; if ORGA- 
NISM is "Artifidal Sequence" or "Unknown"; if 
molecule is combined DNA/RNA. 

O. 
O. 



O. 

o. 
o. 
o. 
o. 
o. 

o. 

o. 

o. 

o. 

o. 

o. 

M. 



